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Ensifer meliloti strain RRI128 is an aerobic, motile. Gram-negative, non-spore-forming rod. 
RRI128 was isolated from a nodule recovered from the roots of barrel medic {Medicago 
truncatula) grown in the greenhouse and inoculated with soil collected from Victoria, Australia. 
The strain is used in commercial inoculants in Australia. RRI128 nodulates and forms an effec- 
tive symbiosis with a diverse range of lucerne cultivars {Medicago sativa) and several species of 
annual medic (M. truncatula, IVIedicago littoralis and Medicago tornata), but forms an ineffective 
symbiosis with Medicago polymorpha. Here we describe the features of E. meliloti strain 
RRI128, together with genome sequence information and annotation. The 6,900,273 bp draft 
genome is arranged into 156 scaffolds of 157 contigs, contains 6,683 protein-coding genes and 
87 RNA-only encoding genes, and is one of 100 rhizobial genomes sequenced as part of the 
DOE Joint Genome Institute 2010 Genomic Encyclopedia for Bacteria and Archaea-Root Nod- 
ule Bacteria (GEBA-RNB) project. 



Introduction 

Ensifer meliloti strain RRI128 is used in Australia to 
produce commercial peat cultures (referred to as 
Group AL inoculants) mainly for the inoculation of 
lucerne [Medicago sativa L). Lucerne is sown on 
about 600, 000 ha annually (A. Humphries pers. 
com.) and is nearly always inoculated prior to sow- 
ing. RRI128 is also used for the inoculation of 
strand medic [Medicago littoralis Loisel) and disc 
medic [Medicago tornata (L.) Miller), a hybrid of 
the two former species, and bokhara clover 
[Melilotus albus Medik). RRI128 has been used 
commercially since 2000 when it replaced strain 
WSM826 [1]. Strain RRI128 was isolated from a 
nodule from the roots of barrel medic [Medicago 
truncatula Gaertn) growing in the greenhouse and 



inoculated with an alkaline sandy soil [pHcaciz 7.6) 
collected by J. Slatteiy, near Tempy, Victoria. 

The strain was selected for use in commercial inoc- 
ulants following assessment of its nitrogen fixation 
capacity (effectiveness), growth on acidified agar 
and saprophytic competence in an in-situ soil study 
[2], with supporting data of satisfactory perfor- 
mance at ten field sites. Additional testing has 
shown RRI128 to be effective on 28 cultivars of lu- 
cerne (Ballard unpub. data). It also forms effective 
symbiosis with a range of strand and disc medics 
[2] which show symbiotic affinity with lucerne 
[3,4]. 

Soil acidity has long been recognized as a con- 
straint to lucerne nodulation [5] with some evi- 
dence that strains of E. meliloti have less acidity 
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tolerance than Ensifer medicae, possibly due to 
their association with Medicago species that favor 
neutral to alkaline soils [6]. With RRI128, con- 
straints to lucerne nodulation are observed around 
pH 5. Nodulation of lucerne seedlings inoculated 
with RRI128 was 42% at pH 5.0 in solution culture 
experiments [7] and observed to decline rapidly at 
field sites where pHcaciz was below 4.7 (Ballard, 
unpub. data). Other strains [e.g. SRDI672) have in- 
creased lucerne nodulation in solution culture at 
pH 4.8 (61% cf. 12% of lucerne seedlings with nod- 
ules) but are probably approaching the limit of 
acidity tolerance forE. meliloti [8]. 

Stable colony morphology and cell survival on seed 
make RRI128 amenable to commercial use. RRI128 
produces colonies of consistent appearance and 
with moderate polysaccharide when grown on 
yeast mannitol agar, enabling easy visual assess- 
ment of culture purity. It differs in this regard from 
the strain it replaced (WSM826) which produced 
'dry' and 'mucoid' colony variants, in common with 
many of the strains that nodulate lucerne and med- 
ic [9]. When applied correctly RRI128 has been 
shown to survive at more than 10,000 cells per lu- 
cerne seed at six weeks after inoculation [10]. Good 
survival may well be characteristic of E. meliloti, 
since former inoculant strain WSM826 is equally 
competent in this regard [11,12]. 

Here we present a preliminary description of the 
general features ofE. meliloti strain RRI128 togeth- 
er with its genome sequence and annotation. 

Classification and general features 

Ensifer meliloti strain RRI128 is a motile, non- 
sporulating, non-encapsulated. Gram-negative rod 



in the order Rhizobiales of the class 
Alphaproteobacteria. The rod-shaped form varies 
in size with dimensions of approximately 0.5 |im 
in width and 1.0-2.0 |im in length (Figure lA). It is 
fast growing, forming colonies within 3-4 days 
when grown on TY [13] or half strength Lupin 
Agar (VzLA) [14] at 28°C. Colonies on VzLA are 
opaque, slightly domed and moderately mucoid 
with smooth margins (Figure IB). 

Minimum Information about the Genome Se- 
quence (MIGS) is provided in Table 1. Figure 2 
shows the phylogenetic neighborhood of Ensifer 
meliloti strain RRI128 in a 16S rRNA gene se- 
quence based tree. This strain has 100% sequence 
identity (1366/1366 bp) at the 16S rRNA se- 
quence level to the fully sequenced E. meliloti 
Sml021 [30] and 99% 16S rRNA sequence 
(1362/1366 bp) identity to the fully sequenced E. 
med/cae strain WSM419 [31]. 

Symbiotaxonomy 

Ensifer meliloti strain RRI128 forms nodules on 
(Nod+) and fixes N2 (Fix+) with Medicago sativa, 
Melillotus albus and Trigonella balansae (Boiss. 
and Reuter). It also forms effective symbiosis with 
several species of annual medic (M. truncatula, M. 
littoralis and M. tomato) that happen to be closely 
related to each other based on their ability to be 
hybridized [5] and morphological and nucleotide 
sequence analyses of their relatedness [32]. 
RRI128 forms ineffective (white) nodules with 
Medicago polymorpha, a species that is generally 
recognized to have a more specific rhizobial re- 
quirement for effective symbiosis than Medicago 
sativa and Medicago littoralis [4,33] (Table 2). 





1 mm 



Figure 1. Images of Ensifer me/z/of/ strain RRI128 using (A) scanning electron microscopy and (B) light mi- 
croscopy to show the colony morphology on TY plates. 
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Table 1. Classification and general features of Ensifer meliloti strain RRI128 

according to the MIGS recommendations [15,161 



MIGS ID Property 



Term 



Evidence code 



Domain Bacteria 
Phylum Proteobacteria 



J AS [15,16] 
TAS [1 7] 







Class Alphaproteobacteria 


TAS [18[ 




Current classification 


Order Rhizobiales 
Family Rhizobiaceae 
Genus Ensifer 
Species Ensifer meliloti 
Strain RRI128 


TAS [19] 
TAS [20] 
TAS [21, 
TAS [23,: 




Gram stain 


Negative 


IDA 




Cell shape 


Rod 


IDA 




Motility 


Motile 


IDA 




Sporulation 


Non-sporulating 


NAS 




Temperature range 


Mesophile 


NAS 




Optimum temperature 


28°C 


NAS 




Salinity 


Non-halophile 


NAS 


MlGS-22 


Oxygen requirement 


Aerobic 


IDA 




Carbon source 


Varied 


NAS 




Energy source 


Chemoorganotroph 


NAS 


MIGS-6 


Habitat 


Soil, root nodule, on host 


IDA 


MIGS-15 


Biotic relationship 


Free living, symbiotic 


IDA 


MlGS-14 


Pathogenicity 


Non -pathogenic 


NAS 




Biosafety level 


1 


TAS [25] 




Isolation 


Root nodule 


IDA 


MlGS-4 


Geographic location 


Tempy, Vict., Australia 


IDA 


MlGS-5 


Soil collection date 


Circa 1995 


IDA 


MIGS-4.1 


Latitude 


-35.1833 


IDA 


MIGS-4.2 


Longitude 


142.3833 


IDA 


MIGS-4.3 


Depth 


0-10 cm 


IDA 


MIGS-4.4 


Altitude 


Not reported 





Evidence codes - IDA: Inferred from Direct Assay; TAS: Traceable Author State- 
ment (i.e., a direct report exists in the literature); NAS: Non-traceable Author 
Statement (i.e., not directly observed for the living, isolated sample, but based on 
a generally accepted property for the species, or anecdotal evidence). These evi- 
dence codes are from the Gene Ontology project [26]. 
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57 



57 



72 



99 



Ensifer meliloti GVPV12 (Gi08912) 
Ensifer meliloti AK83 (Gc0 1810)* 
Ensifer meliloti Sm1021 {Gc00059)* 
Ensifer meliloti SM1 1 (CP001 830 GcOl 686)* 
Ensifer meliloti Mlalz-I (Gi08913) 
Ensifer meliloti RRI128 (Gi08915) 
Ensifer meliloti AK5B (Gi07577) 
Ensifer meliloti MVII-I (Gi08914) 
Ensifer meliloti CIAM1775 (Gi08844) 
Ensifer meliloti LMG 6133"r(X67222) 
Ens/fer meWo?/ WSM1 022 (Gi0891 6) 

Ensifer meliloti AH4^ (Gi08911) 

Ensifer medicae WSM419 (Gc00590)* 
Ensifer medicae WSM1369 (Gi08907) 
Ensifer medicae WSM 1 11 5 (Gi08906) 
Ensifer medicae Di28 (Gi08905) 
Ensifer medicae WSM244 (Gi08916) 
Ensifer medicae WSM4191 (Gi08903) 
Ensifer medicae A321 t {L39882) 
Ensifer arboris LMG 14919 (Gi08822) (syn: HAMBI 1552"^) 
Ensifer saheli LMG 7837"^ (X68390) 



94 



71 



51 



62 



Ensifer kostiense LMG 19227T (AM 181 748) 



Ensifer sp. TW10 (Gi08835) 
Ensifersp. WSM1721 {Gi08904) 



76 



Ensifer chiapanecum ITTG S70 (EU286550) 

Ensifer mexicanum ITTG R7'' (DQ41 1930) 
Ensifer terangae LMG 7834^ (X68388) 



0.002 

Figure 2. Phylogenetic tree showing the relationship of Ensifer meliloti strain RRI128 (shown in bold) with some of 
the root nodule bacteria in the order Rliizobiales based on aligned sequences of the 16S rRNA gene (1,307 bp in- 
ternal region). All sites were informative and there were no gap-containing sites. Phylogenetic analyses were per- 
formed using MEGA [27], version 5.05. The tree was built using the maximum likelihood method with the General 
Time Reversible model. Bootstrap analysis [28] with 500 replicates was performed to assess the support of the 
clusters. Type strains are indicated with a superscript T. Brackets after the strain name contain a DNA database ac- 
cession number and/or a GOLD ID (beginning with the prefix G) for a sequencing project registered in GOLD 
[29]. Published genomes are indicated with an asterisk. 
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Table 2. Compatibility of RRn28 with various Medtago and allied genera for nodulation (Nod) and N2-fixation (Fix). 
Species name Cultivar or line Common Name Growth Type Nod Fix Reference 



Medtago sa tiva *2 8 c ul ti va rs 



Lucerne, Alfalfa 



Perennial 



[2] 



M. Iktoralis 



Harbinger, Herald, Angel 



Strand medic 



Annual 



[2] 



M. tornata 



Tornafield, Rivoli 



Disc medic 



Annual 



[2] 



M. tomataxiktoralis Toreador 



Hybrid disc medic 



Annual 



[2] 



M. truncatula 



Jester 



Barrel medic 



Annual 



IDA 



M. polymorpha Scimitar 



Burr medic 



Annual 



+(w) 



IDA 



Trigonella balansae SA5045, S A32 999, S A3302 5 Sickle fruited fenugreek Annual 



[34] 



Melilotus alb us 



SA1 991 7, SA3562 7, SA34665 Bokhara clover 



Biennial 



IDA 



*28 cultivars tested: Aquarius, Aurora, Cropper 9, Cuff 101, Eureka, Genesis, Hallmark, Hunterfield, Hunter River, 
Jinderra, ML 99, PL 55, PL 60, PL 69, Prime, SARD! Five, SARDI Seven, SARDI Ten, Sceptre, Sequel, Sequel-HR, 
Siriver, Trifecta, UQL1, Venus, WL525HQ, 54Q53 and 57Q75. 

(w) indicates white nodules. 

IDA: Inferred from Direct Assay; evidence code from the Gene Ontology project [26] 



Genome sequencing and annotation 
Genome project history 

This organism was selected for sequencing on tlie 
basis of its environmental and agricultural rele- 
vance to issues in global carbon cycling, alterna- 
tive energy production, and biogeochemical im- 
portance, and is part of the Community Sequenc- 
ing Program at the U.S. Department of Energy, 
Joint Genome Institute (JGI) for projects of rele- 
vance to agency missions. The genome project is 
deposited in the Genomes OnLine Database [29] 
and an improved-high-quality-draft genome se- 
quence in IMG/GEBA. Sequencing, finishing and 



annotation were performed by the JGI. A summary 
of the project information is shown in Table 3. 

Growth conditions and DNA isolation 

Ensifer meliloti strain RRI128 was cultured to mid 
logarithmic phase in 60 ml of TY rich medium on a 
gyratory shaker at 28°C [35]. DNA was isolated 
from the cells using a CTAB (Cetyl trimethyl am- 
monium bromide] bacterial genomic DNA isola- 
tion method [36]. 
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Table 3. Genome sequencing project information for Ensifer meliloti strain RRI128 
MIGS ID Property Term 



MIGS-31 Finishing quality 

MIGS-28 Libraries used 

MIGS-29 Sequencing platforms 

MlGS-31.2 Sequencing coverage 

MlGS-30 Assemblers 

MlGS-32 Gene calling methods 
Genbank ID 



High-Quality-Draft 
1 X lllumina Std library 
lllumina HiSeq2000 
285x lllumina 

with Allpaths, version r39750, Velvet 1 .1 .04 
Prodigal 1.4 
ATYPOOOOOOOO 
Genbank Date of Release September 5, 2013 
GOLD ID Gi08915 
GenBank ID X67222 
Database: IMG-GEBA 251 32 37091 

Project relevance Symbiotic N2 fixation, agriculture 



Genome sequencing and assembly 

The genome of Ensifer meliloti strain RRI128 was 
sequenced at the Joint Genome Institute QGI] using 
lllumina [37] technology. An lllumina standard shot- 
gun library was constructed and sequenced using 
the lllumina HiSeq 2000 platform, which generated 
13,085,546 reads totaling 1,962 Mb of lllumina data. 

All general aspects of library construction and se- 
quencing performed at the JGI can be found at the 
JGI user home [36]. All raw lllumina sequence data 
was passed through DUK, a filtering program de- 
veloped at JGI, which removes known lllumina se- 
quencing and library preparation artifacts 
(Mingkun, L., Copeland, A. and Han, J., unpublished). 
The following steps were then performed for as- 
sembly: (1) filtered lllumina reads were assembled 
using Velvet [38], version 1.1.04, (2) 1-3 Kb simu- 
lated paired end reads were created from Velvet 
contigs using wgsim [39], (3) lllumina reads were 
assembled with simulated read pairs using 
AUpaths-LG [40] (version r39750). 

Parameters for assembly steps were: 

(1 ) Velvet (Velvet optimizer params: -v -s 51 — e 
71 -i 2 -t 1 -f "-shortPaired -fastq $FASTQ" - 
-o "-ins_length 250 -min_contig_Igth 500") 

(2) wgsim (-e 0 -1 76 -2 76 -r 0 -R 0 -X 0,) (3) 
Allpaths-LG 

( P repa re Allpaths I np uts :P H RE D 64= 1 
PL01DY=1 FRAGCOVERAGE=125 
JUMPCOVERAGE=25 LONGjUMPCOV=50, 
RunAllpath-sLG: THREADS=8 
RUN=stdshredpairs TARGETS=standard 
VAPIWARNONLY=TrueOVERWRlTE=True). 



The final draft assembly contained 157 contigs in 
156 scaffolds. The total size of the genome is 6.9 
Mb and the final assembly is based on 1,962 Mb of 
lllumina data, which provides an average 285x 
coverage of the genome. 

Genome annotation 

Genes were identified using Prodigal [41] as part 
of the Oak Ridge National Laboratory genome an- 
notation pipeline. The predicted CDSs were trans- 
lated and used to search the National Center for 
Biotechnology Information (NCBI) non-redundant 
database, UniProt, TIGRFam, Pfam, PRIAM, KEGG, 
COG, and InterPro databases. These data sources 
were combined to assert a product description for 
each predicted protein. Non-coding genes and 
miscellaneous features were predicted using 
tRNAscan-SE [42] RNAMMer [43], Rfam [44], 
TMHMM [45], and SignalP [46]. Additional gene 
prediction analyses and functional annotation 
were performed within the Integrated Microbial 
Genomes (IMG-ER) platform [47]. 

Genome properties 

The genome is 6,900,273 nucleotides with 61.98% 
GC content (Table 4) and comprised of 156 scaf- 
folds (Figures 3a,3b,3c,3d,3e). From a total of 
6,770 genes, 6,683 were protein encoding and 87 
RNA only encoding genes. The majority of genes 
(78.79%) were assigned a putative function whilst 
the remaining genes were annotated as hypothet- 
ical. The distribution of genes into COGs functional 
categories is presented in Table 5. 
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Table 4. Genome Statistics for Ensifer meliloti strain RRI128 



Attribute 


Value % 


of Total 


Genome size (bp) 


6,900,2 73 


100.00 


DNA coding region (bp) 


5,931,61 1 


85.96 


DNA G+C content (bp) 


4,2 76,906 


61 .98 


Number of scaffolds 


156 




Number of contigs 


157 




Total gene 


6,770 


100.00 


RNA genes 


87 


1 .29 


rRNA operons 


1 * 




Protein-coding genes 


6,683 


98.71 


Genes with function prediction 


5,334 


78.79 


Genes assigned to COGs 


5,314 


78.49 


Genes assigned Pfam domains 


5,505 


81.31 


Genes with signal peptides 


569 


8.40 


Genes with transmembrane helices 


1,483 


21.91 


CRISPR repeats 


0 





*2 copies of 5S, 1 copy of 16S and 1 copy of 23S rRNA 




ufjrifij'ii 



1 1 1 II 



Figure 3a. Graphical map of YU7DRAFT_scaffold_0.1 of the genome of Ensifer 
meliloti strain RRI128. From bottom to the top of each scaffold: Genes on for- 
ward strand (color by COG categories as denoted by the IMG platform). Genes 
on reverse strand (color by COG categories), RNA genes (tRNAs green, sRNAs 
red, other RNAs black), GC content, GC skew. 



I I I III I I 1 1 I I I I ' I I II 




I I I I 



I I I I I I 



Figure 3b. Graphical map of YU7DRAFT_scaffold_1 .2 of the genome of Ensifer 
me///ot/ strain RR1128. From bottom to the top of each scaffold: Genes on forward 
strand (color by COG categories as denoted by the IMG platform). Genes on re- 
verse strand (color by COG categories), RNA genes (tRNAs green, sRNAs red, 
other RNAs black), GC content, GC skew. 
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Figure 3c. Graphical map of YU7DRAFT_scaffold_2.3 of the genome of 
Ensifer me/;7ot; strain RRn28. From bottom to the top of each scaffold: 
Genes on forward strand (color by COG categories as denoted by the 
IMG platform), Genes on reverse strand (color by COG categories), 
RNA genes (tRNAs green, sRNAs red, other RNAs black), GC content, 
GC skew. 




Figure 3d. Graphical map of YU7DRAFT_scaffold_3.4 of the genome 
of Ensifer meliloti strain RRn28. From bottom to the top of each scaf- 
fold: Genes on forward strand (color by COG categories as denoted 
by the IMG platform). Genes on reverse strand (color by COG catego- 
ries), RNA genes (tRNAs green, sRNAs red, other RNAs black), GC 
content, GC skew. 




o ifl - - 

Figure 3e. Graphical map of YU7DRAFT_scaffold_4.5 of the genome 
of Ensifer meliloti strain RRn28. From bottom to the top of each scaf- 
fold: Genes on forward strand (color by COG categories as denoted 
by the IMG platform). Genes on reverse strand (color by COG catego- 
ries), RNA genes (tRNAs green, sRNAs red, other RNAs black), GC 
content, GC skew. 
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Table 5. Number of protein coding genes of Ensifer meliloti stra\n RRI128 associated 
with the general COG functional categories 

Code Value %aRe COG Category 



J 


202 


3.41 


Translation, ribosomal structure and biogenesis 


A 


0 


0.00 


RNA processing and modification 


K 


520 


8.78 


Transcription 


L 


2 72 


4.59 


Replication, recombination and repair 


B 


2 


0.03 


Chromatin structure and dynamics 


D 


47 


0.79 


Cell cycle control, mitosis and meiosis 


Y 


0 


0.00 


Nuclear structure 


V 


61 


1.03 


Defense mechanisms 


T 


237 


4.00 


Signal transduction mechanisms 


M 


294 


4.97 


Cell wall/membrane biogenesis 


N 


75 


1.27 


Cell motility 


Z 


0 


0.00 


Cytoskeleton 


w 


1 


0.02 


Extracellular structures 


u 


116 


1.96 


Intraceilulartrafficking and secretion 


o 


186 


3.14 


Posttranslational modification, protein turnover, chaperones 


c 


355 


6.00 


Energy production conversion 


G 


594 


10.03 


Carbohydrate transport and metabolism 


E 


673 


11.37 


Amino acid transport metabolism 


F 


108 


1.82 


Nucleotide transport and metabolism 


H 


197 


3.33 


Coenzyme transport and metabolism 


1 


216 


3.65 


Lipid transport and metabolism 


P 


306 


5.17 


Inorganic ion transport and metabolism 


Q 


168 


2.84 


Secondary metabolite biosynthesis, transport and catabolism 


R 


705 


11.91 


General function prediction only 


S 


585 


9.88 


Function unknown 




1,456 


21.51 


Not in COGS 
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